REMARKS 

Claims 78-85, 96-103, 114-121, 132-139, previously withdrawn are now cancelled. Claims 
68, 86, 104 and 122 have been amended, as suggested by the Examiner, to recite that the variant 
"has alpha-amylase activity." Claims 68, 86, 104 and 122 have also been amended to remove the 
phrase "(using SEQ ID NO:3 for numbering)" as this is redundant in view of the claim language "at 
position equivalent to positions... in SEQ ID NO:3." 

Claims 140-155 have been added. New Claims 140-147 correspond to the allowable 
subject matter identified on page 10 of the Office Action. In particular, on page 10 of the Office 
Action, the Office states that claims 76, 77, 94, 95, 1 12, 1 13, 130 and 131 were objected as being 
dependent upon a rejected base claim, but would be allowable if rewritten in independent form 
including all of the limitations of the base claim and any intervening claims. 

New claims 148-155 are directed to variants a parent a Bacillus stearothermophilus 
alpha-amylase wherein the variant is at least 90% or at least 95% homologous to the parent 
Bacillus stearothermophilus alpha-amylase and comprises one of the claimed pair wise 
deletions. Support for these claims is found, inter alia, in the specification at page 10, third and 
fifth paragraphs, and page 13, lines 13-27. Claims 148-155 are also believed to be allowable 
for the reasons supporting the allowability of new claims 140-147. 

The specification is amended, as requested by the Examiner, to identify the application 
no. for U.S. Patent No. 6,093,562, and to provide the status for USSN 09/354,191. 

Certified copies of the four Danish priority documents are enclosed. 

An information disclosure statement is submitted herewith. 

It is respectfully submitted that the present amendment presents no new issues or new 
matter and places this case in condition for allowance. Reconsideration of the application in view 
of the above amendments and the following remarks is requested. 

I. The Rejection of Claims 68-72, 86-90, 104-108 and 122-126 under 35 U.S.C. 112 

Claims 68-72, 86-90, 104-108 and 122-126 are rejected under 35 U.S.C. 112, first 
paragraph, as lacking adequate written description support. The Office states that the 
specification does not adequately describe the claimed genus of variants of a parent alpha- 
amylase having at least 80% identity to the referenced alpha-amylase and having a deletion of 
amino acid residues equivalent to positions 180 and 181, 179 and 181, 179 and 182 or 180 or 
182. The Office suggests amending claims 68, 86, 104 and 122 to recite "wherein said variant 
has alpha-amylase activity, has at least 80%...." 
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Applicants have amended the claims as suggested by the Examiner to recite that the 
variant has alpha-amylase activity, and respectfully submit that claims 68-72, 86-90, 104-108 and 
122-126 now fulfill the written description requirement of the Patent Code. 

The test for determining compliance with the written description requirement is whether 
the disclosure of the application reasonably conveys to the artisan that the inventor had 
possession of the claimed subject matter at the time the original application was filed. See In re 
Kaslow, 217 USPQ 1089, 1096 (Fed. Cir. 1983). A specification complies with the written 
description requirement if it provides "a precise definition, such as by structure, formula, chemical 
name, or physical properties of the claimed subject matter sufficient to distinguish it from other 
materials." See, e.g., University of California v. Eli Lilly and Co., 43 U.S.P.Q.2d 1398, 1404 (Fed. 
Cir. 1997); Enzo Biochem v. Gen-Probe Inc., 63 U.S.P.Q.2d 1609, 1613 (Fed. Cir. 2002). A 
description of a claimed genus may be achieved by recitation of a representative number of 
species falling within the scope of the genus or by a recitation of structural features common to 
the members of the genus which constitute a substantial portion of the genus. See Eli Lilly and 
Co., 43 U.S.P.Q.2d at 1569. 

As presently amended, claims 68-72, 86-90, 104-108 and 122-126 now clarify that the 
variants have alpha-amylase activity. It is respectfully submitted that the disclosure of the 
present application provides a sufficient written description such that one skilled in the art would 
conclude that the Applicants were in possession of the claimed alpha-amylase variants at the 
time of filing. As disclosed in the specification, the claimed deletions can be applied to alpha- 
amylases having "at least 80% homology" to the reference sequences. See, e.g., the 
specification at page 4, lines 7-14. Describing a polypeptide by its sequence similarity to a 
reference sequence is well-accepted in the art as a suitable method for describing the 
polypeptide's structure and function, as polypeptides having very high degree sequence similarity 
of at least 80% homology to a reference polypeptide, are reasonably expected to have both a very 
similar structure and function/activity as the reference polypeptide. 

Applicants have also described that the deletions are applicable to other alpha-amylases 
besides the reference sequences. Such alpha-amylases include variant alpha-amylases that 
have been altered at other positions, as described in the specification at pages 12-17. 

In sum, Applicants' description that the claimed invention is applicable to those 
polypeptides which have a degree of homology of at least 80% to the reference sequence and 
Applicants' identification of representative examples, is sufficient for the artisan to conclude that 
Applicants had possession of the claimed invention as it provides recitation of structural features 
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common to the members of the genus and representative members falling within the claimed 
genus. 

For the foregoing reasons, Applicants submit that the claims overcome this rejection under 
35 U.S.C. 112. Applicants respectfully request reconsideration and withdrawal of the rejection. 

II. The Rejection of Claims 68-75, 86-93, 104-111 and 122-129 under 35 U.S.C. 112 

Claims 68-75, 86-93, 104-111 and 122-129 are rejected under 35 U.S.C. 112, as lacking 
enablement. This rejection is respectfully traversed. 

Section 112 of U.S. Patent Code requires that the specification be "enabling" to a person 
skilled in the art to which the invention pertains. Enablement requires that the specification teach 
those in the art to make and use the invention without undue experimentation. In re Wands, 8 
USPQ2d 1400 (Fed. Cir. 1988). 

It is respectfully submitted that the claims are enabled for alpha-amylase variants which 
are at least 80% homologous to the reference sequence and comprise one of the pair wise 
deletions recited in the claims. The specification provides examples of representative species 
falling within the scope of the claimed invention in the form of suitable alterations which can be 
made in combination with the claimed pair wise deletions. See the specification at pages 12-17. 
In addition to these representative examples, as of the filing date of the present application, many 
Bacillus stearothermophilus alpha-amylases variants were known in the art, which provided 
information on important conserved sequences as well as sequences which could be altered 
without destroying the alpha-amylase activity. See, e.g., Holm et al., "Random mutagenesis used 
to probe the structure and function of Bacillus stearothermophilus alpha amylase", Prot. Eng., Vol. 
3, No. 3, pp. 181-191 (1990), Vihinen et al., "Site-directed mutagenesis of a thermostable alpha- 
amylase from Bacillus stearothermophilus: putative role of three conserved residues, J. Biocem, 
107(2):267-72 (1990), attached as Exhibits A and B, respectively. 

Consistent with the Holm et al. and Vihinen et al. publications, the specification discloses 
that alpha-amylases falling within the scope of the claimed invention may be prepared by site 
directed mutagenesis and random mutagenesis, including localized random mutagenesis. See 
the specification at page 18-24. Although the production and identification of suitable variants 
falling within the scope of the claimed 80% homology would involve some experimentation, it is 
respectfully submitted that such experimentation is not undue. In particular, both the task of 
producing such sequences and the task of screening the sequences for variants with alpha- 
amylase activity and/or improved alpha-amylase activity was routine in the art as of the time of the 
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invention. See id. The artisan could, e.g., rapidly produce and screen thousands of alpha- 
amylase variants and identify those variants which possess alpha-amylase activity using the 
processes of site directed mutagenesis, random mutagenesis and localized random mutagenesis 
following by an alpha-amylase activity assay as described in the specification at page 54 
("Determination of a-amylase activity"). 

In addition, although the three-dimensional models available at the time of the present 
invention were generally fungal based alpha-amylase models, which had some significant 
limitations in the application to Bacillus stearothermophilus alpha-amylases, they did provide some 
insight into conserved structural features common to alpha-amylases in general. See Holm et al. 
and Vihinen et al. (modeling a Bacillus stearothermophilus alpha-amylase on the basis of the 
fungal TAKA alpha-amylase structure). Thus, although these models were not accurately 
representative of the complete structure of Bacillus stearothermophilus alpha-amylases, they did 
provide some guidance into a number of conserved structural features of the molecule which an 
artisan would preserve when making other alterations. 

Accordingly, it is respectfully submitted that the present invention is enabled for making 
alpha-amylase variants having a claimed pair wise deletion and which are at least 80% 
homologous to the reference sequence. 

For the foregoing reasons, Applicants submit that the claims overcome this rejection under 
35 U.S.C. 112. Applicants respectfully request reconsideration and withdrawal of the rejection. 

III. Conclusion 

In view of the above, it is respectfully submitted that all claims are in condition for 
allowance. Early action to that end is respectfully requested. The Examiner is hereby invited to 
contact the undersigned by telephone if there are any questions concerning this amendment or 
application. 




Respectfully submitted, 



Date: October 6, 2004 



Jas^fil. Garbell, Reg. No. 44,116 
Nefvozymes North America, Inc. 
500 Fifth Avenue, Suite 1600 
New York, NY 10110 
(212)840-0097 
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Random mutagenesis used to probe the structure and function of 
Bacalus stearothermophUus alpha-amylase 



LOsa Holm, Anu K.Koivula, PMivi M.Lehtovaara, 
Ari Hemminki and Jonathan K.C.Knowles 

Biotochnical Laboratory, VTT, Tielotie 2, SF-Q2150 Espoo. Finfand 

Mutations that cover the sequence of Bacillus stearothermo- 
phtlus a-amylase were produced by an efficient in vitro 
enzymatic random mutagenesis method and the mutant 
or-amylases were expressed in Escherichia coli, which also 
secreted the product. Ninety-eight mutants were identified by 
sequencing and their enzyme activities were classified into 
three classes: wild-type, reduced or null, A molecular model 
of the enzyme was constructed using the coordinates ofTaka- 
amylase A and a consensus alignment of mammalian, plant, 
and bacterial a-amylases. The location of mutant amino acids 
on the model indicate that mutations which destroy or 
decrease the catalytic activity are particularly clustered: 
(0 around the active site and along the substrate-binding 
groove and (ii) in the interface between the central al0 barrel 
and the C-terminal domain. Exposed loops are typically 
tolerant towards mutations. 

Key words: homology modelling/sequence alignment/starch 
Introduction 

a-Amylases (a-1 ,4-glucan-4-glucanohydroIase, EC 3.2.1.1) are 
widely distributed starch-degrading enzymes found in plants, 
animals and bacteria. Because of the importance of starch as a 
raw material in a number of industries, particularly the food 
industries, a-amylase is an enzyme of considerable commercial 
significance Despite this, there is relatively little information 
concerning either structure or function of this interesting family 
of enzymes. It is a common observation that even the availability 
of a good three-dimensional structure does not necessarily make 
it possible to predict the effects of specific amino acid changes 
even in the case where information about the enzyme mechanism 
is available (Knowles, 1988). For this reason we have developed 
a novel random mutagenesis method (Lehtovaara ex al., 1988). 
The method permits the enzymatic generation of a library of 
single (and multiple) base mutations throughout a defined region 
of DNA which can be several kilobases long. This method has 
now been applied to the cloned gene of Bacillus stearothermo- 
Ptotos a-amylase. A large number of mutations have been 
identified by sequencing in the course of refining the 
methodology. 

The three-dimensional structures of two a-amylases, one from 
Aspergillus oryzae, the so-called Taka-amylase A, and the other 
from pig pancreas, have been determined by X-ray 
crystallography (Matsuura era/., 1984; Buisson et al. , 1988). 
Both a-amylases have three domains: A, B and C. Overall, their 
structures appear very similar, but some differences are observed 
in the structure of domain 3 and in the orientation of domain 
C relative to domain A (Buisson et a/., 1988). The N-terminal 
domain A corresponds to the well-known (a/0) s barrel motif, 
which was first seen in triose phosphme isonierase (Plimips <t a/. . 
'* ln noddle of domain A, the chain loops out to make 
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domain B, which forms a lid above the barrel. The active she S 

is located at the top of the barrel on the C-terminal siTof tS 5 

^ lot ^* SubStnUC anal08Ues bind m site £ZZ 1 

acids around tins site which are conserved in all known a-arnS -3 

sequences. In B stearothermophUus a-amyiase they correspond % 

to residues D234, E264 and D331. Matsuura etlliSm « 

propose that residues E231 and D297 of Taka-amylase A t 

corresponding to D331 and E264 of B. stearothermophUus, are I 

tfie catalytic residues analogously to the mechanism inWyme I 

Buisson et at. (1988) propose that residues D196 and D300 of 3 
porcine pancreatic a-amylase, corresponding to D234 and D33i 

olBstearothermophilus, are catalytic. A binding she for calcium % 

which is essential for activity (Steer and Levitzky 1973) k i 
located between domains A and B. The C-terminal region is 
folded into an antiparallel /3-barrel and forms domain C AWhough 

it has proved possible to crystallize the enzyme from Bstearo- * 

bacterial a-amylases are not yet available. I 

It has been shown that tertiary structures are better conserved 1 

in evolution than amino acid sequences (Chothia and Lesk, 1986) 1 

Computer-aided molecular modelling has been usedmanumber n - 

of cases to construct a model of a protein of unknown structure Srs* 

which is related to a protein with a known structure [for a review 7<A 

see Blundell et al (1987)]. a-Amylase genes from a variety of Vf 

sources have been sequenced. Limited inter-species homology III" 

has been reported to be located in short stretches around the active J 

site (Svensson, 1988). Using a sensitive method, we have found (131 
sufficient homology over almost all of the amino acid sequence 

which justifies homology modelling of the B.stearothermophilus 3 

sequence onto the Taka-amylase A structure. It is known that ^ 

the three-dimensional structures of porcine pancreatic a-amylase ^> 
and Taka-amylase A are similar. We show that the sequence 

homology between B. stearothermophUus and Taka-amylase A % 

is more significant than between the former pair. The Taka- H: 

amylase A structure was chosen because it is currently the only M/ 

publicly available a-amylase structure. LiJ 

In this paper, we present a three-dimensional structural model CD 
of B.stearothermophilus a-amylase based on the structure of 
Taka-amylase A and show that the properties of the random 
mutants can be consistently interpreted with the aid of this model. 

Materials and methods 

Generation of point mutations 

The a-amylase gene from B. stearothermophUus ATCC 12980 
was cloned as a 1.9 kb long fragment to the phage vectors 
M13mpl8 (coding strand) and MI3mpl9 (non-coding strand). 
The insert contains the complete coding region of the a-amylase 
gene (1650 bp). Random enzymatic niutageiKSis of the clooed 
DNA was performed according to the method of Lehtovaara I 
etai (1988).Theniethodcanbesnnniar^ 
from oligonucleotide primers, a population of molecules is 
synthesized under conditions where one nucleotide is limiting. 
This gives a population of molecules each of which ends just 
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before the omitted base. After removal of all nucleotides, 
mishcorporation with reverse transcriptase is carried out under 
optimized conditions to give, in principle, a 1:1:1 ratio of the 
incorrect bases. Following synthesis of the second strand, the 
mutated dsDNA is transfected into Escherichia coli. The mutant 
library was generated in the course of developing the random 
mutagenesis method. Five oligonucleotides (20-mers) at -200 
nucleotide intervals were used to mutagenize the A nucleotides 
in the coding region of a-amylase in the M13mpl9 construction. 
With one of these primers the template nucleotides G and C were 
also mutagenized. Only one primer was used to mutagenize T 
and G nucleotides in the M13mpl8 construction (covering amino 
acids H208-L248). 

Characterization of mutants 

A suitable dut+ung* strain ofE.coli. such as JM109 or TG2, 
was transfected with the mutagenized DNA and the cells were 
plated on glucose minimal medium/H-top amylopectin azure 
plates. Expression of functional recombinant a-amylase in E.coti 
was detected as halos around the plaques. A three-grade scale 
was used to assay the a-amylase activity as follows: ++ tor 
wild-type-size halos, + for smaller but visible halos and - for 
non-detectable halos. Mutations were identified by DNA 
sequencing. 

Atomic coordinates for the starting model 
The atomic coordinates for the starting model were from the 
structure of Taka-amylase A at 3.0 A resolution [Matsuura et al. , 
1984; available from the Protein Data Bank (Bernstein et al 
1977) as entry 2TAA]. 

Sequence alignment 

Because of a high level of noise, automatic Needleman -Wunsch 
(1970) procedures are not applicable to the alignment of sequences 
that are only -20% identical, which is the case for most sequence 
pairs in our sample. The sensitive alignment program of Argos 
(1987) was used for initial pairwise alignments. Discrepancies 
between different pairwise alignments were resolved by visual 
pattern matching to produce a consensus alignment of the 
sequences. The positions of a helices and fi strands were taken 
for pig pancreatic a-amylase from Buisson et al. (1988)> and for 
Taka-amylase A they were calculated from the coordinates using 
the DSSP program (Kabsch and Sander, 1983). Helices and 
strands of pig pancreatic a-amylase and Taka-amylase A were 
matched. Gaps were introduced to maximize similarity across 
all sequences according to the following conservative substitution 
groups: |P, G), |S, TJ, {E, D, N, QJ, (H, Y, F, WJ, [A, I, L, 
V, M, C), |K, R). At positions corresponding to a known helix 
or strand, however, gaps were not allowed in any sequence. The 
significance of pairwise alignments was assessed by the correla- 
tion of five physical characteristics of amino acids (hydropho- 
bkity, buUriness, turn preference, strand preference, refractivity 
index) as given by Argos (1987). This measure proved quite 
sensitive to small changes such as the movement of just a single 
residue in a gap. Argos (1987) recommends that alignments 
where the average correlation is at >2.0 SD above random be 
considered seriously. Adjustments were made to the alignment 
until the value was about this threshold for the majority of 
pairwise alignments. 

Model building 

ft was assume*! that the backbone structure in aligned positions 
is the same between Taka-amylase A and B.stecrothermophilus 
a-amylase. Loops involving gaps were patched using fragments 
selected from the database of known structures (Jones and Thimp, 
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1986). One to three C(cr) atoms at both ends of the loop wete 
matched to the template (derived from Taka-amylase A). 
Different anchors were tried out until a fragment was found wheie 
the fit to C(a)s was good (deviation of superposed fragment to 
anchor atoms < 1 A). Ideally, ail main chain atoms of the 
fragment and of the starting structure would overlap, but in 
practice the overlap could be restricted to just one residue at both 
ends. Main chain coordinates were replaced by those of the loop 
starting from the second residue in the N-side anchor region and 
ending with the second-last residue in the C-side anchor region. 
Graphics was used to make sure that the selected loops packed 
tightly to the rest of the structure. Structures that would clash 
with the rest of the molecule were rejected already when they 
were collected from the database. Side chain coordinates for 
aligned identical residues were copied from the Taka-amylase 
A structure, otherwise they were inserted in standard con- 
formation. The conformation of F346 and W492 was changed 
to another staggered conformation because they initially clashed 
with the main chain. 

Strain introduced to the model through sidechain substitutions 
and joining of loops was relaxed by the program CHARMm 
(Brooks et al , 1983) using an adapted basis Newton -Raphson 
energy miriimization with a distance-dependent dielectric constant 
and an 8 A cut off for non-bonded interactions. The calcium ion 
was not included in the minimization. 

A chain of six glucose units joined by a-i,4-linkages was built 
in the interactive molecular modelling program Quanta (Polygen 
Inc.). Hie dihedral angles in the aldose linkage were +90 and 
-31° for 05-C1-04-C4 and C1-04-C4-C3 respectively. The 
interaction energy of a water probe to protein was calculated in 
Quanta. Energy countours at an arbitrarily chosen level of 
—2.0 kcal/mol formed a tube which passed by the catalytic 
residues, and the glucose chain was fitted manually to it. Rotation 
of one glycosidic bond wus necessary to fit the chain in contact 
with the catalytic residues (D234, E264, D331). The substrate 
was relaxed by energy minimization keeping the protein part 
frozen. 

Results 

Alignment of a-amylase sequences 

The alignment of distantly related o-amylase sequences is far 
from obvious because only six blocks of residues are clearly 
conserved in them (Svensson, 1988). However, when the 
correlation of physical properties of amino acids is used as a 
measure, an acceptable alignment in terms of standard deviations 
above random sequence comparison scores can be made over 
the whole length of rite sequences (Table I). The alignment in 
Figure 1 shows that the best conserved regions are in domains 
A and C, and these are segments around the catalytic residues 
and secondary structure elements where particularly hydrophobic 
patches are conserved. Although domain C lacks invariant 
residues, the patterns of alternating hydrophobic and hydrophilic 
residues are preserved in the 0 strands. 

In the aligned a-amylase sequences (Figure 1) there are 31 
positions where the nature of the amino acid is conserved. Of 
these, 13 positions require hydrophobic residues, three are 
invanamly glycine, three require an aromatic residue and seven 
require an invariant charged residue. 

In domain B there is very Utile amino ac id homology . Further, 
the arrangement of 0 strands in the two known structures, rjorcine 
pancreatic a-amylase and Taka-amylase A, is distinctly different 
(Buisson era/., 1988). In addition, in the amyloliquefying 
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^ . : pig pancreatic a-amylase and were therefore excluded from the 

alignment Identical amino acids in adjacent sequences are marked by a colon. A dot marks changes within the following groups: |P, GL IS. T| IEDN 
Ql, |H. Y, F, W|, |A, I, L, V. M. C|. ,'K, R|. Gaps are denoted by An asterisk below the aHgrnneni marks SIcS^ fa irf . 
plus denotes positions where variability is limited to two different groups, a Helices and 0 strands are underlined in the known structures. C^reomes^ond to 
surface loops in the structure of Taka-amylase A. Residue numbers for B^earothermophilus are shown above its sequence. 

Bacillus a-amylases, domain B contains a 43 -45 residues long 
insert relative to Taka-amylase A. For these reasons the proposed 
structure for domain B should be treated with considerable 
caution. 

Model building 

The alignment shown in Figure 1 was used to construct a three- 
dimensional model of B.stearothermophilus a-amy!ase using the 
structure of Taka-amylase . A as a template (Figure 2). Due to 
the lack of a suitable structural template, two segments have been 
omitted from the model: the 45-residue insert in domain B— the 
model joins residue 142 to residue 189— and the C-terminal 
extension of 19 residues (residues 497 -515). Without these two 
segments, construction of the structural models was relatively 
straightforward since the remaining shorter insertions and 
deletions involve only a few residues. Insertions or deletions also 
occur in places where the polypeptide chain folds back on itself 
so that loops of different lengths are easy to accommodate. Thus 
the structural changes made during modelling were minimized. 
One difficult region was between amino acids 444 and 456, in 
which a loop between two 0 strands in the Taka-?jnylase A 
structure is shortened by five residues. This deletion could only 
be incorporated by using a long extended loop which caused 
concomitant restructuring of this region by displacing one 0 
strand. As a result, domain C appears more flat and open in our 
model than in Taka-amylase A. 

Mutant library 

Point mutations distributed throughout the a-amylase gene were 
generated by an enzy matic misincorporation method (Lehtovaara 
et al. , 1988). In the work presented here, mutants were identified 
by sequencing and subsequently assayed on starch plates for 
enzyme activity. In this sample of 98 phenotypic mutants, there 
are 31 which show no activity, 28 with decreased activity and 
39 with wild-type enzymatic activity (Table II). Seventy-five 
different residues are hit by at least one phenotypic mutation. 




fig. 2. Structural model of B.stearothermophilus a-amylase based on die 
known structure of Taka-amylase A. Left: Taka-amylase A. right: model of 
& stearothermophilus or-amyiase. Only the N, C(a) and C atoms of the 
main chain are shown. The model was constructed based on ^fimrr 
homology to Taka-amylase A. the structure of which has been determined 
by X-ray crystallography. A 45-residue insert from the back of domain B 
and a 17-residue extension to the C terminus have been ornittod. The light 
blue sphere shows the location of the essential calcium ton a* found in 
Taka-amylase A. Red. green and yellow spheres mark the probable catalytic 
residues. Loops 17-38 and 333-350 of Taka-amylase A and the 
corresponding loops 15—19 and 370-374 of B. stearofhermophitus 
a-amylase are coloured yellow. Domain B is coloured orange and domain C 
green in B. steartxhermophiUts a-amylase. Domain A (blue) is an (a'ftg 
barrel. 



and Figure 3 shows that they are distributed over most of the 
protein. Ousters of residues at which mutations affect activity 
are seen around the active site, and at die interface between 
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Table IL Mutants generated by (he random mutagenesis method 








Activity 


Observed amino acids 


(i> Mutations to stop codons 








R126U 


E12%top(S131S) 








ElMstop 










K141stop 










KlSSstop 
K257atop 










Y439F 


K442stop 








(ii) Matatioi 


is in the active she 








R23ZW 








232: R 


D234G 








234: D 


H2WN 






+ 


238: HG 


(V236V) 


H238Y 








H238Y 






- 




(L259L) 


T261A 




+ + 




T26IS 


E264V 








E264D 


Y265S 








Y265S 






+ T 




Y265F 






+ 




D331A 








331: D 


TO2P 


E333D 






332; TNM 


E333A 






+ + 


333: EQYP gap 


(iii) Mutatta 


ns in the substrate-binding groove 










I//or 




+ + 


IS: YERSK 


\rtrc 


UloA 






18: YDNQ gap 




nllM/ i\iMf iniP 

£?/or D19V T21S 


L22F 


+ 


19: DSC gap 


vice 

Mi JO 


Dior D 19V T2 1 P 


L22F 


+ 


21: TQG gap 


« / JO 


UloA LJIVV IZIr 


L22F 


+ 


22: LHWSTD gap 


uiov 


ntov 
DI9V 




+ 




MJMOV 


mo a 




+ 




t\SOA 

UIcA 


U19V T21S 




+ + 




D18A 


D19V T21P L22F 




+ 




SS1R 






— 


5I:SPEYYQIA 


D54V 


(V55V) W7C 




— 


53: SAVDNTGH 


DS4A 


(V55V) K57S (G58G) 






54: DCTIQN5 gap 


D54V 


(V55V) K57S (G58G) 






57: RKY gap 


Y60F 








60: YGW 


Y63C 






+ 


61: DRTHWE 


K237R 






+ + 


oj: T 


P334S 










Q336L 






+ 


jJo: v I UMiA gap 


Q336H 






+ + 


Q336P 










Y369S 






+ 


369: YMFfcPR 


Y3T0S 






+ 


3 Ah YrQEWR 


(iv) Mutations in or around the interface between domains A and C 








D343A 






+ 


343: DQPNGS 


K347N 






+ 


347: KVURY 


V351S 






+ 


351: Y1VQWN 


DSO, 








354: IFVAM 


R413G 








413: RDCQ 


R413W 


E414A 




_ 


414: EDKRGS 


R413W 


E414V 




- 




E414V 






+ + 




E4I4D 








417: TS gap 


T417P 


E4I8A 




+ 


418: ES gap 


(A426A) 


I428F 






428: ILFA gap 


S435F 








435: SAYLGW 


S4S5C 






+ 


436: KDSQAFT 


(v) Mntanoi 


b in other buried sites (solvent-accessible surface of wOd-tvpe residue <50 A 2 ) 






E13V 






+ + 




D19V 


T21S 




+ + 
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Table U (continued). 



Activity 



Observed 



DI9V 


T21? 


E29D 




N32I 


U3F 


L»F 




L33F 


S35C 


K48T 




EI29G 




Q254P 




I277F 




L291F 




G336C 




G335C 


A337T 


A337T 





(t22L) 
S35C 



T24P 



+ 
+ 

+ + 

+ + 
+ + 
+ 



(vi) Mutations in other exposed sites (solvent-accessible surface of wild-type residue >50 A 2 



L32F 


+ + 






N31Y 


+ 


NU7T 




TI33S Y134F 




YI34S 


+ + 


Y134F 




^I35L 


+ + 


QI37P 




A138T 


+ + 


N149I 


+ + 


H208L 


+ + 


T2I3A 


+ + 


K216E 


+ + 


S217G 


+ + 


S217C 




K220R 


+ + 


T225P 


+ 


Y250F 


+ 


T255P 




K257T 


+ + 


K257Q 


+ + 


K257M 


+ + 


K419N 


+ + 


K419T 




Y4S9F K442Q 


+ 


K442N 




Q443H 


+ + 



29: EDKA 
32: NYHDMC 
33: UGKTE 
35: SEDAGFIYQ 



291: LTDRS 
335: GFN ga 
337: ASQE 



25: KRMG gap 
31:NAED 
127: NLQTWBF 



225: TEDNMI gap 
250: YHAV5T 
255:TSAgap 



419: KVA gap 
439: YAQVSQLE 
442: KRtSDNQA g 



The wild-type amino acid is given on the left and the annuo acid it is mutated to is given on the right of the residue number. Deleterious 
printed in bold. Note mat some of the mutants have multiple substitutions; comparisons with corresponding single mutants sornemi 
change. In clusters (like the one around residue 20). where the casuatrve mutations are less dear, the primary suspects arc printed 
oroer f™***" 15 **** **** interpreted as being neutral. Silent mutations seen only at the DNA level are in parentheses. Enzyme 
were estimated on starch plates as compared to the wild-type done: + + wild-type activity, + less than wild-type but greaser tfa 
done showed drastically elevated activity. Mutants are listed grouped accmdiug to their location in the strucuual model The 
was calculated using the program DSSP (Kabsch and Sander. 1983) from the model of wOcMypc B.stearuthermopkiba o-amylase. „ 
208 -^00 belong to domain A. residues 105-207 to domain B and residues 401 -515 to domain C. For convenience of cornparison 
sequences, the right-most column lists for + and - mutants which amino acids are observed in natural crstmylases at given positM 
also includes five tnaimnalian seouences (P1R codes ALHUS, ALHUP, ALMSS, ALMSP, ALRTP) which are tugtdy homologous 
and which were excluded from Figure I. 




domains A and C. The relationship between the effects of 
different mutations and their location on the model is discussed 
in more detail below. 

Active site 

There are many conserved and buried charged residues around 
the active she. D331 is relatively accessible to solvent but the 
other two proposed catalytic residues, D234 and £264, are close 
neighbours and appear almost buried at the bottom of a pocket. 
The most striking mutation in this region is the chemically 



rtornologous substitution E264D, which is inactive (in a double 
mutant with Y265S, which as a single mutation has wild-type 
activity). This strongly suggests that E264 is essential for 
catalysis. In addition the mutation D331A on one hand and the 
mutant D234G on the other are inactive, suggesting that the 
carboxyl groups of all three, D234, E264 and D33 1 , pby a rale 
in the enzymatic mechanism. In the energy minimized model, 
R232 is in the centre of a artwork of (rydrogen bonis aod s^t 
bridges, which may fix the catalytic residues in the correct 
orientation for catar^ D10KH33O 

187 



LMahaetaL 




and E264 and further to W42. D33I, N329 and D288. All of 
these residues are highly conserved. In other a-amylases W4"> 

^ ^.Z? ^ by Q ' N329 b * S W 0288 «V N or E or A 
™i L . mentioned are invariant (Figure I). The 

mutants R232G and R232W are inactive, thus demoStrating the 
functional importance of this conserved arginine residue 

Subsrrate-binaing groove 

t ^ f r de ' d ? ere ■ ? pearly visible groove winding across the 
top of the a/0 band in domain A, into which a six-unit amylose 



chain was docked. Because D234 and E264 are at the bottom 
of a deep pocket, the long substrate chain could only be brought 
into contact with them by introducing a sharp bend in the chain. 
The glucose units make extensive contacts with a number of 
residues equivalent to those proposed for Taka-amylase A 
(Matsuura etai, 1984) and porcine pancreatic a-amylase 
(Buisson et ai, 1988). Those residue* that are in contact with 
the substrate at the subsites adjacent to the hydrolyzed bond are 
highly conserved in all a-amylases, but more variation is allowed 
at the other subsites. 
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In Taka-amylase A the regkm of the substrate-bimling groove 
furthest from the catalytic she is Mocked by the adjacent 
protruding loops 17-38 and 333-350 (residue numbers 
Riding to the Taka-amylase A sequence). They correspond 
to the much shorter loops 15-19 and 370-374 in B.stearo- 
thermophilus a-amylase (Figure 2). We speculate that the 
'missing* subsites might explain the fact that Taka-amylase A 
is more saccharifying than Bacillus a-amylases (Priest, 1984) 
by biasing the substrate specificity of Taka-amylase A towards 
residues at die end of the amylase chain. 

In the model of B.stearothermophilus a-amylase there are 
notably many tyrosine side chains lining the substrate-binding 
groove. Severn mutations, single or multiple, in these residues 
(Y15, Y57, Y60, Y63, Y369, Y370) lead to decreased activity 
(Table D). From its location and mutant data (Table II), Q336 
is also implied in substrate binding in B.stearomermophilus 
a-amylase, though it has no counterpart in Taka-amylase A for 
example (Figure I). W266 is in contact with the docked substrate 
It corresponds to tryptophan 206 in die barley a-amylase T which 
has been pinpointed as essential for enzymic function in 
biochemical mapping (Gibson and Svensson, 1987). interestingly 
in some owimylases this position is occupied by hydrophobic 
residues other than tryptophan. 

Interface between domains A and C 

A second major cluster of inactivating mutations is found in and 
around the interface between domains A and C, -30 A away 
from the catalytic site (Table U and Figure 3). In low solution 
X-ray studies of porcine pancreatic a-amylase a second binding 
site for substrate analogues has been identified near the 
N-tenninus of the eighth helix in domain A that is facing domain 
C (Payan el al. y 1980). The function of this site is unknown 
but it has been implied that it may play a role either in anchoring 
the enzyme to starch or in the regulation of activity (Buisson 
et a/., 1981). In our model both salt bridges and hydrophobic 
contacts are found across the interface between domains A and 
C The latter interactions are affected in the inactive mutant 
I428F, which is located on a 0 strand in domain C, and in the 
mutant B54L, which is on the seventh a helix in domain A and 
has much decreased activity although this amino acid substitution 
is conservative. The dramatic effects of such Innocent-looking' 
mutations seem to imply that the exact orientation of domain C 
to domain A is crucial for activity. In the model of B.stearo- 
^rmophilus a-amylase these two residues are opposite to each 
other- Many combinations of hydrophobic residues are found at 
these positions in the other a-amylases (Table n, Figure 1). 
Since, particularly in domain C, many gaps were introduced into 
the alignment (Figure I ), the structural details of domain C and 
Us contacts with domain A are obviously different in the different 
enzymes. This is clearly illustrated by comparing the known 
stnictures of Taka-amylase A and porcine pancreatic a-amylase 
(Buisson a/., 1988). That domain C is important for activity 
is also shown by the effect of a mutation that causes a premature 
stop codon at residue 442. This removes roughly twcHhirds of 
aomain c leads to inactivation of the enzyme. 
Calcium-binding site 

The stnicture of the calcium binding she between domains A and 
p is similar m the two known structures, and the in variant D203 
££f ^1 U&mds oftneca, ™m wn (Buisson etal.. 1988). 
DIQ5 and the maL . '-am carbonyl oxygen from the invariant 
H238 are other hgands. D105 is replaced by an asparagine in 
the other a-amylases (Figure 1). The importance of H-bonds 
involving the stdechain of H238 is suggested by the mutations 



H238N, which is still functional, and H238Y f which is inactive 
(Table II). 

Buned sites are typically more sensitive to mutation man 
exposed sites 

The great majority of amino acid replacemems wfiididonot much 
affect the activity of the enzyme are located at exposed sites in 
die 3-D model. Mutations affecting the active site region and the 
A-C mterdomain region which inactivate a-amylase were 
described above. Several other examples, where even conserva- 
tive mutations are deksericus, were found at buried sites For 
example, the inactive mutamL33F and the mutant L291F which 
has reduced activity, are located in the interior ofdornain A 
Interestingly, the substitution corresponding to L29IF is found 
in B.circulans a-amylase. An examination of the structure of 
Taka-amylase A and models of B.circulans and B.stearvmermo- 
pmlus a-amylase indicated that Fhe can te accommodated a* tfm 
position in the core of B.circulans a-amylase due to a number 
of compensating substitutions to smaller residues in its 
neighbourhood, but it could not be fitted to the other two 
stroctsres without steric clashes. 

E29D, located at a partially buried site, has reduced activity. 
In the three^limensional model E29 forms a salt bridge to K25. 
This seems to be an import 

philus enzyme, since die mutant K25N is inactive. The other 
liquefying Bacillus enzymes have D at position 29, but the 
reduction in size of this residue is compensated by Rat position 
25. No corresponding salt bridge is present in the structure of 
Taka-amylase A. 

At certain sites proline may be accommodated without effects 
on the activity, and at other sites it cannot be accommodated 
(Table II). For example, die mutations Q254P, leading to fully 
active enzyme, and T255P, leading to inactive enzyme, are in 
neighbouring residues in a surface loop. 
Interactive effects in multiple mutations 
Table D shows that generally the effects of amino acid substi- 
tutions are additive. There are, however, a ninnber of eHsptions. 
For example, G335C alone gives an inactive enzyme, but activity 
is recovered if A337T is also present. A337T alone has no effect 
on activity. Other interesting examples include the clusters of 
mutations around residues D18 and L33. 



We describe here the construction of a molecular model of 

B.stearothermophilus a-amylase using a novel alignfnrTtt of 

amino acid sequences from many distantly related a-amylases 

and the available information on secondary/tertiary structures. 

We also present the phenotype and spatial location of 98 separate 

randomly generated amino ackl mma^ 

mutations are clustered in the model, revealing functionally 

important regions. Thus, the model provides a rationale for 

interpretation of the mutation^ 

provides evidence supporting the model. 

Porcine pancreatic a-amyiase and Taka-amylase A have only 
23% identical amino acids (Table I) but very similar three- 
dimensional stnictures. We have assumed that all a-amylases, 
which have typically -20% identical residues, dare this 
common structural design. There are several well known 
examples of protein tami^ 

like serine proteases, where die seqiiencescaa diverge even more 
but the three-dimensional structures are well conserved. The 
correct alignment of the amino acid sequences to identity residues 
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occupying topologically equivalent positions is critical to the 
success of homology rooddling (Read a/. f 1984). Argos (1987) 
has described a cautionary example concerning die pairwise 
alignment of lactate and nialate dehyd/ ogei t ase : dm sequence 
alignment optimal in terms of amino acid identity does not 
coincide with die alignment derived from the optimal super- 
position of the tertiary structures. However, the alignment of 
a-amylases described here is supported by structural information 
concerning calcium ligands and secondary structure. In addition, 
amino acids of similar physical properties are well aligned. At 
least 70% of residues in all sequence pairs are aligned in Figure 1 
and, statistically, the alignment is significant (Table I). It is 
generally accepted that if the sequence of a protein with unknown 
tertiary structure can be related to one with known architecture, 
then die main-chain folding of the latter provides an «Ayp«ate 
base for structural modelling (Blundell et a!., 1987). However, 
in the course of evolution, mutations are accommodated in protein 
structures through small shifts in die relative position of secondary 
structure elements (Chothia and Lesk, 1982a). Such effects cannot 
be accounted for in modelling procedures which use a rigid 
framework. Refinement by energy minimization results only in 
small shifts in relative atomic positions, typically in the order 
of 1 A at most. Thus, it is clear that this model represents only 
one possible low-energy conformation of the protein. Despite 
these reservations, we were pleased to note that the a-amylase 
model is consistent with biochemical data obtained from random 



For this work, it was more important to gather information 
on a large number of mutants than to analyze a few in detail. 
Mutants generated by random mutagenesis were identified by 
sequencing, since it was important to understand what types of 
base changes the mis incorporation procedure produced 
(Lehtovaara et al y 1988). Around 40% of phenotypic mutants 
in the sample described here retained wild-type activity. The 
remainder had reduced activity or were inactive. 

The loss of enzyme activity observed in the plate assay could 
be assayed by: (i) impaired function of secreted enzyme or (ii) a 
reduction in the amount of enzyme due to destabilization of the 
secreted protein or impaired folding and even degradation inside 
E-coti. Mutational analysis of a number of proteins has shown 
that amino acids which are critical to protein stability are generally 
relatively rigid and inaccessible to solvent in the folded protein 
whilst substitutions at mobile and exposed sites usually have little 
effect on protein stability (reviewed by Alber, 1989). Exceptions 
are made by the introduction of prolines or replacement of 
conformationally special glycines, which can propagate shifts in 
the folded structure even if the mutation is at an exposed site 
(Alber, 1989). Our interpretations concerning the effects of the 
mutations are based on the location and interactions of the mutated 
residues in the structural model. Mutations which apparently 
impair function are clustered together in three-dimensional space 
at two surface sites: die active she deft and the domain 
A -domain C interface. In the substrate-binding groove, a 
number of exposed residues, which may form hydrogen bonds 
to the substrate, were shown to be sensitive to mutation. The 
catalytic center and calcium-binding site are also identified from 
the known X-ray structures and by sequence conservation. Seven 
absdu^ conserved charged residues in the family of ff-amylases 
are located here. Four of these residues were mutated in our 
sam^e and led to inactive enzyme in each case . Apart from the 
dusters of deleterious mutations mentioned above, exposed 
residues were found to be less sensitive to mutations than buried 
ones. A number of mutations to proline and mutations substituting 
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a larger sidechain at buried sites were apparently deleterious 
because they disrupt the tertiary structure. 

The pH dependence of a-amylase suggests a lysozyme-type 
mechanism employing two carboxy lie ac ids , one ionized and the 
other protonated in the active form. The ad-<jcent E264 and D234 
are buried, and therefore may have elevated pJ^s, whereas 
D331 is more exposed and is expected to have a normal pK^. 
Matsuura et al. (1984) have argued that in Taka-amylase A two 
carboxyl groups in different environments are required for activity 
and therefore the residue corresponding by homology to D331 
in B. stearothermophilus a-amylase is a catalytic residue. The 
proposals by Matsuura et al (1984) and Buisson et al (1988) 
agree on the role of D33I as a catalytic residue in a-amylases 
but differ with respect to the roles of the neighbouring residues 
E264 and D234. Buisson et al. (1988) reason that D234, which 
has more direct interactions with the cairium-binrimg site , kin»»iy 
to be the other catalytic residue. However, our mutant data very 
strongly suggest that E264 is a catalytic residue. 

Domain C is shown here to be indispensable for activity since 
a cluster of mutations far from the active site reduce enzyme 
activity. Subtle conformational changes at the interface between 
domains A and C, which we assume accompany the mutations 
I428F and I354L, seem to have a large effect on activity. It has 
been proposed for mammalian a-amylases that a she at die 
domain A -domain C interface might have a regulairry role in 
that substrate analogues bind to this region and affect activity 
(Payan et a/., 1980; Buisson et aL 9 1981). It is not clear why 
a bacterial exoenzyme should require this type of regulation. We 
therefore propose that domain C plays an important role in starch 
hydrolysis by orientating the active site cleft of domain A 
correctly with respect to the amylose chain. 

In conclusion, we suggest that this approach of integrating 
computer-aided molecular modelling to the rationalization of 
random mutagenesis data can contribute substantially to an 
understanding of protein structure and function, even in cases 
like this where the crystal structure of that protein is not available. 
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conserved residues. mKSTJST 2 * I si ^dlre^ mut^enesis of the three most 
reduced the specific acuX Ld °* aad binding 

ture optima. SS^mS3^£ SiTv r? t i^' bUt did DOt affect P H and *«P«- 
tion. &teres^^^ lnac « va - 
product pr^by^S^^S^ ^ * enZyme showed * tered 

conserved Arg^o, WteSKfal^ ^^1°^ ^^ve mutation of the 
specific activity: ar^ 

range but about 0^1^171 enzyme. This mutant enzyme had a wider dH 

molecular modeling. different mutants were interpreted by ^pute* aided 



zn^lTV*^ ^ pr0duced b * mic ™***. Plants, and 
aimnals to hydrolyze starch and other linear and branched 
M-glucosidic ohgo- and polysaccharides. Almost all 
amylases studied (for a review see Ref. J) have a few 
?™r ed A re ^ ons < 2 ' 5) with approximately the same 
length and spacing. One of these regions resembles the Ca' ♦ 
binding site of calmodulin and the others are thought to be 
^IT^^ 2 r ^ e « Ctive ^ enter bindin e the substrate 

1 .// U K refine rhraB-Himfn cp^ struchirP nf t\* A* r » r 

*5E&^w^ Sug « ested to bin d ^ven glucose 
^ ldue8t ° on the surface of the protein £done~ 

I ? ut *? tes ' containing amino acids Glu-230 and 

^;; 7 » w . aS P T Qpo ? d to * the catalytic center (4). 
^cent structural studies on porcine pancreatic a -amylase 

fi^JS^J^^-^ » mon! ^ to * ^lved 
m sutetrote bmding (5). The catalytic amino acids of TAA 
are ttiUs Asp 206 and Asp-297 (5),TKe calcium ion, found 
£ virtuall y all or -amylases and required for stability ( 1). is 
bound to four amino acids in TAA and PPA (5) 

We have used the cloned, thermostable Bacillus steor- 
othermophilus amylase (BStA) in studies concerning 
expression and secretion (6-9). Here we studied the role of 
the three most conserved amino acids in maintaining the 

?2s oTvzrsr ° y Koundation - *• «-* 

' 'Z^T^ 6 ^^ 6hould * M. Vihinen, Depart- 

ment of Biochemistry, University of Turku, SF-20500 Turku 
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structure, activity, and stability of the thermostable 
* -amylase.,The structure of the enzyme was modeled from 
the known structure of TAA, and this model was used to 
interpret the results obtained with mutated enzymes 
Conserved residues Arg-232, His-238, and Asp-331 were 
all shown to have important functions. 

MATERIALS AND METHODS 

Strains, Culture Conditions, and Assays-Strains Es- 
cherichia coli JM103 (10); MC1061 (11); R21032 (12) 
BaciUus subtUis 1A40 (Bacillus Genetics Stock Center* 
Ohio), and phages Ml3mpl8 and mpl9 (13) have been 
described L broth, 2xTY, and M9 mediums were sup. 
plemented with 100^g/ml ampicillin or 10,/g/ml kana- 
?l y iS? W necessary Plasmid pCSS6 (7) and p602/ 
I5HT have been described, a- Amylase activity was mea- 
sured as described (7) and protein concentration was 
determined according to Lowry (15), 

DNA Techniques and In Vitro Mutagenesis-lsohtion of 
plasmids and single-stranded DNA sequencing was accord- 
ing to Maniatis et al (14) and double stranded sequencing 
was according to Zagursky et al (16). Competent E. coli 
(17) and B. subtilis (18) cells were transformed as de- 
scnbed. 

Oligonucleotide directed mutagenesis was performed by 
the method of Kunkel (19). Template DNA used for 
mutagenesis was a 539 bp Kpnl-Pstl fragment of the B 
stearothermophilus a-amylase gene cloned into the corre- 
sponding sites of Ml3mpl8 or m P 19. The oligonucleotides 
were hybridized with the uridinylated templates extracted 
from E. coli RZ1032. Plaque lift hybridization was used to 
screen for the mutants of E. coli JM103 and the positive 
candidates were analyzed by sequencing. To reconstruct 
the ^-amylase gene the mutated Kpnl-Pstl fragment was 
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subcloned in pCSS6 and transformed in E. coli JM103. The 
mutated a-.amylase genes in pCSS6 were cut with Dral 
m **Z. G **t wer « filled in. Then the genes were digested 
with ««n«ni for cloning in B. subtilis with p602/15HT 
which had been cleaved by BamHl and HindlU after the 
ends had been filled in. To clone the mutant Arg-232-Lys 
V?J? Ft* *- am y las « Bene had to be cloned in p602/ 
15HT and the mutated Kpnl-Pstl fragment ligated to this 
construction because the mutation generated a newrecogni- 
tion site for tfzndlll. After ligation E. coU MC1061 cells 
were transformed and the extracted shuttle plasmids were 
introduced into B. subtilis 1A40. 

Purification and Characterization of a-Amylase-The. 
wild-type and mutant enzymes were purified from B 
subhhs clones. The culture fluid was heat treated (75'C 1 
h), after which the enzyme in supernatant was absorbed'to 
insoluble starch (1-5%) in 10% (NrL),SO,. Impurities 

ES? ^° Ve l by washing ^ 10 Tri 5 (PH 7.5), 10% 
w i^f 0<; the en2 y m e was liberated with 4 M urea 10 

^ ( ^ 7 ^ md J « ain8t 10 ™M Tris (pH 

7.5). 10 mM CaCl 2 was included throughout the purification 
procedure. The incubation time in temperature and pH 
optimum measurements was 15 min. For the pH optimum 
determination aliquots of enzymes were incubated at 70'C 
with substrates buffered to various pH values The pH 
electrode used was a Radiometer GK2401C (Radiometer 
A/b. Copenhagen, Denmark). Complete hydrolysis of 
starch was performed overnight at 50'C in 10 mM Tris (pH 
6.8), 2 mM CaCl„ and 2% com starch. Remaining starch 
was removed by centrifugation and samples of the super- 
natents were analyzed on HPLC in a //Bondaback carbo- 
hydrate column. 

Labeling and Immunoprecipitation Conditions -All the 
growth media were supplemented with kanamycin Over- 
night cultures of B. subtilis in 2xTY were used to inoculate 
bMS medium (20). The cells were incubated atarCto^ 
0.5-0,6 and a 1 ml portion was centrifuged and the cells 
resuspended in SMS medium lacking methionine. After 30 
mm [ Sjmethionine (100 A Ci) was added and the cells 
were allowed to grow for 2 h. The culture was centrifuged 



TAA 117 DWANH 122 

BStA 101 DWFDH 106 

BAA 98 DWLKH 103 

iv» BLA 100 DWINH 105 



II 

202 GLRIDTVKH 210 
230 GFRlQAVXH 238 
227 GFRIDAAKH 235 
227 GFRLOAVKH 235 
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to separatecells. The supernatant was saved (extracellular 
"f k were washed twice **h 10 mM Tris 
?2 ^V, 5 ^ NaCl ' mi ««Pended in 10 mM Tris (5? 
7.3), 1 mM EDTA, and then disrupted by sonicatS The 
suspension was centrifuged at 13 ,000 Xg for 30 min and the 
supernatant was saved (cytoplasmic fraction). The im 
munopreapitation was performed according to Minsky* 

Sequence Comparison and Computer Modeling-Se 
quence analyses were performed according to Smith anH 
Waterman (22) and Needleman and WuriscS f using 
GCG software (24) and the multiple sequence comparison 
was according to Vihinen (25). The structural models for 
the deduced ammo acid sequences of <r -amylases from B 
sUarothermophilus (6), Bacillus licheniformis (BLA) (26) 
and Bacillus amyloliquefaciens (BAA) (27) were built by 
computer modeling on an Evans & Sutherland PS330 
graphics terminal. Computer programs employed were 
Frodo (28), Hydra (Polygen, WahLm, M^ ) Insist 
fflbSZ . Ted 7 l0 8i e8 > San Diego. Calif.), and Gromos 
(biomos b.v., Gromngen, The Netherlands). The three- 
dimensional structure of TAA (4) was obtained from the 
Brookhaven Data Bank. Replacements, insertions and 
deletions were introduced to the TAA structure and the 
models were refined by energy minimization (29). 

RESULTS 

Sequence ABgiysts-Numerous ^-amylase' sequences 
from widely differing genera have been reported and the 
refined structure of the TAA at 3 A resolution is available 
beveral methods have been employed to compare and align 
sequences. Recently programs to handle simultaneously 
several sequences have become available, and the compari- 
sons have been extended to predicted secondary structures 
V dg ^ mpanson md alignmentof the amino acid sequence 
of BStA with that of plant, mammalian, and microbial 
or-amylases were performed with several computer pro- 
grams. The results compiled in Fig. 1 show the existence of 
the four conserved regions in TAA, BStA, BAA, and BLA 
sequences as reported previously (2, 3). The overall 
augnment of the sequences is d ifferent from that pr^t^ 
by Makaiima et al (30) . ' 




III 

TAA 229 GEVLD 233 
BStA 263 <$YWS 267 
BAA 260 AEYWQ 264 
BLA 260 AEYWQ 264 



IV 

292 FVENHD 297 
326 FVDNH^ 331 
323 FVENHD 328 
323 FVDKHD 328 
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Fig. 1. The four conserved regions of A. oryzae (TAA), R 
sUarothermophilus (BStA), B. amyloliquefaciens (BAA), and 
B. licheniformis (BLA) <x-amylascs. 



Fig. 2. Synthesis of native 
(wt) and mutant Asp-331-GIu 
cr-amylases in B. subtilis. Cells 
were Labeled with ("SJmeth- 
ionine, centrifuged, and sonicat- 
ed. The culture fluid (Ext) and 
disrupted cells (Cyt) were ana- 
lyzed by 10-15% SDS gel electro- 
phoresis. The enzyme and the 
fraction are listed above each lane. 
The arrow indicates position of 
or -amylase. 



«/. Biochcm. 



Site-Directed Mutagenesis of a Thermostable a* Amylase 



269 



Fig. 3. Thermostability 
of native (on the left), 
mutant His-238-Asp (in 
the middle), and mutant 
Arg-232-Lys (on the 
right) enzymes produced 
' in B. subtil is. Aliquots of 
enzymes were incubated at 
70*C <A), 80*C (0), and 
90*C (O), and samples 
were withdrawn at the in- 
dicated times for residual 
activity analysis. 




120 ISO 160 



TIME (min) 



TABLE I.; Short chain end-products liberated from soluble starch by the native and mutant a-amylases. The end-products are 
denoted by Gl to G6 to show linear oligosaccharides from glucose to maltohexaose. Total activity indicates the total amount of short chain 



Enzyme 






Percentage of end products 






Total 


Gl 


G2 


G3 


G4 


G5 


G6 


activity 


Wild type 


5.7 


21.9 


26.7 . 


10.3 


21.9 




100.0 


Arg-232-Lys 


1.9 


14.5 


19.0 


6.8 


31.1 


18.6 


58.0 


His-238-Asp 


22.4 


21.5 


10.5 


— 5.9 


24.3 




99.3 


Asp-331-Glu 




46.5 


53.5 








15.5 




Fig. 4. Activity of the native (o), mutant His -238 -Asp (o) f 
and mutant Arg-232-Lys (a) relative to pH at 70*0. 



Characterization of Mutant Proteins — The mutagenesis 
strategy used was that described by Kunkel (19). The 
following mutations were constructed to test the functions 
of the three most conserved amino acids that were iden- 
tified by sequence comparison: Arg-232-Lys. His-238-Asp 
and Asp-331-Glu. The mutant proteins were purified from 
B. subklis and their enzymatic properties were compared. 
Immunoprecipitation of the [''S] methionine labeled cell 
fractions and culture fluid by anti-ar -amylase indicated that 
B. subtilis cells containing the wild-type and the mutant 
Asp-331-Glu or-amylases contained approximately the 
same amount of precipitating material (Fig. 2). It can be 
suggested that these proteins are produced at similar rate 
because the constructions were similar. 

Enzymatic Properties of Mutants — Temperature optima 
of the wild-type and mutant proteins were found to be 
identical, at 80*C (data not shown). The mutant Arg-232- 
Lys retained almost the same heat stability as the wild- 



type a -amylase whereas the mutant His-238-Asp was 
inactivated remarkably faster (Pig. 3). The pH optima of 
the parental and mutated proteins are at pH 5.5 (Fig. 4). 
However, the mutant Arg-232-Lys has a much broader 
optimal pH for activity, from pH4.5 to 7.0, than the 
wild-type enzyme, which has optimal activity from pH 4.5 
to 5.5. The pH optimum of the mutant His-238- Asp shifted 
slightly towards neutrality. 

The mutant Arg-232-Lys had only 12% and the mutant 
His-238-Asp 42% of the specific activity of the parental 
enzyme. The mutant Asp-331-Glu has almost no activity. 
The effect of the mutations on the extent of the hydrolysis 
reaction was tested by analyzing liberated short-chain 
oligosaccharides. The end-product profile of the mutant 
His-238-Asp was clearly different from that of the wild- 
type enzyme and shifted towards shorter end-products 
(Table I). The mutant enzyme liberated four times more 
glucose than the parental enzyme. Also the mutant Arg- 
232-Lys had a slightly altered end-product profile. It gave 
a higher proportion of longer oligosaccharides, malto- 
pentaose, and maltohexaose. End-product analysis showed 
that the mutant Asp-331-Glu also exhibited some anxio- 
lytic activity, the only short-chain products being maltose 
and maltotriose. The yield of oligosaccharides was only 
15.5% of that of the wild-type enzyme. The mutant His- 
238- Asp liberated similar amount of short-chain sac- 
charides as the wild-type enzyme and the mutant Arg-232- 
Lys about half of that The specific activities and total 
activities in Table I cannot be compared, since the total 
activity is given at end point. 

The effect of soluble starch concentration on the activity 
was examined at 70*C and pH 5.5. The Michaelis constant 
is 4.5, 5.3, and 6.2 mg/ml for the wild type, Arg-232-Lys 
and His-238- Asp mutants, respectively. 

Modeling of a- Amylase S(ractares^The three-dimen- 
sional structures of BStA, BAA, and BLA were modeled 
based on the TAA structure in an attempt to understand the 
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relation between the altered properties of the mutant 
proteins and their structure. The entire structure of the 
Bacillus a -amylases could not be modeled from TAA 
because they all have an insertion of about 45 amino acids 
near the active center and the Ca~* binding site. The 




Fig. 5. Computer model of BStA catalytic residues and His- 
238. The model shows the function of His-238 (in yellow) in substrate 
binding and Ca" binding. Ca=- is shown in the upper left corner with 
magenta dots. The surface of the active site. Asp-234 (upper) and 
Asp-331 (lower), is indicated by blue dots. Maltotriose as substrate is 
in red. The glucosidic bond to be broken is in white. 



insertion is unique to liquefying Bacillus ^-amylases; it 
cannot be found in any other a -amylase sequence pub- 
lished. This region could perhaps participate in substrate 
binding. The B. amybliquefacicns and the B. licheniformis 
a -amylases were modeled to aid in evaluating the validity 
of the B. stearotkermophilus model, since homology be- 
tween BStA and TAA is not more than about 40%. The 
Bacillus ^-amylases are more similar: homology is 60- 
70%. If independent models of quite homologous a- 
amylases are essentially similar, the validity of the models 
is supported. The active site regions are very similar in all 
three models (29). The overall structure of PPA is similar 
to that of TAA. Unfortunately only the coordinates of TAA 
were available. 

DISCUSSION 

The importance of three conserved residues in thermosta- 
ble B. stearothermophilus a- amylase was studied by site 
directed mutagenesis. Computer methods were used to 
explain the results. Even when the protein structure has 
not been determined, as was the case in this work, com- 
puter modeling can be useful and suggest hypotheses which 
can further be tested by mutagenesis studies. 

One of the four conserved regions in a-amylases, region 
I, has been suggested to be involved in Ca 2+ binding (2). It 
has been proposed that Asn-121 of this region in TAA is a 
Ca J +-binding residue (5). The other amino acids related to 
Ca 1+ binding in TAA are Asp-163, Asp- 175, and His-210 
(5). The His-210 is also involved in substrate binding and 
it is conserved in region II, but the two other amino acids, 
although quite conserved, are not located in the.conserved 
regions. Based on the structural model, the Ca J + is bound to 
BStA by side chain atoms of Asp-105, Asp-190, and 
Asp-203 and by main chain atoms (probably via carbonyl 
oxygen) of His-238 (29). The structure of the TAA and the 




Tyr-fiS. which is frw to prevent binding of the substrate to the active site. 
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models support the view that region I is necessary for 
maintaining the conformation of domain B (nomenclature 
according to Ref. 5) suitable for forming the Ca a+ binding 
site. Many of the conserved residues from regions I and II 
are involved in forming the interface between domains A 
and B. The function of the Ca 2 * could be in tightening the 
connection of the domains. Regions II and III are involved 
m forming the active site and substrate binding regions 
Based on the sequence comparisons and the modeled 
structures, we propose that Asp-234 and Asp-331 are the 
active site residues in BStA. 

Mutant Asp.331-Glu-According to our model, the 
substrate is bound to BStA between the Asp-234 and 
Asp-331 (Fig. 5). The distance between the bound sub- 
strate and the carboxyl oxygen atoms of the active site 

f^^J* ' m Mtive 3A > bu * in the mutant 

Asp-331-Glu it is only 2.2 A. The shorter distance between 
the active site residues in the mutant protein would appear 
to cause a steric hindrance and does not leave enough space 
for the substrate to bind correctly. 

This mutant protein was not analyzed because of much 
reduced activity. Maltose and maltotriose are the only 
short-chain end-products liberated from starch by this 
mutant, possibly because of a changed substrate binding 
mechanism. The reduced activity of the mutant protein is 
probably not due to low stabUity because it is not more 
active in lower temperatures (data not shown). Immuno- 
precipitation of mutant protein indicates that the gene for 
the mutant Asp-331~Glu is expressed as efficiently as the 
parental enzyme. From Fig. 2 it can be seen that the 
enzyme produced is efficiently secreted to the extracellular 
medium, which rules out the possibility of restricted 
export. 

Mutant His-238-Asp-llis.2l0 in TAA has been sug- 
gested to be involved both in substrate binding (4) and 
calcium binding (5). The sequence comparison shows that 
the counterpart in BStA is His-238, and the model suggests 
that it could indeed participate in both functions, too. It was 
replaced by Asp to introduce a charge change near the 
active site. 

Figure 5 shows the substrate binding to His-238 and 
catalytic site residues. The substrate is bound to the active 

/ l Jf ^ ft by hydr °S en b 011 ^ hydrophobic interactions 
(5). The His-238 could be one of the key interaction sites 
outside the catalytic center and may govern the proper 
orientation of the flexible substrate. The reduced specific 
activity of the mutant His-238-Asp (42% of that in the 
parental enzyme), higher amounts of starch required in 
purification to obtain binding, and shift towards shorter 
end-products (Table I) suggests that this mutant has an 
altered substrate binding to the subsites. However, the 
value of the mutant protein is essentially similar to that of 
the wild-type enzyme. This may indicate that in the mutant 
protein binding to subsites outside the active site could be 
rate limiting although the has changed only slightly. The 
mutation His- 238- Asp has reduced thermostability; activi- 
ty is almost totally lost at BQ'C in 20 min whereas the 
parental protein retains 80% of the activity even after 2 h 
under the same conditions (Fig. 3). High Ca 2 * or Na* 
concentration does not affect the thermal stability. The 
temperature optimum, however, is the same as the paren- 
tal enzyme. The mechanism of stabilization by Ca 2 *, which 
is bound to the interface of two domains, A and B, is 



271 

presumably by tightening the connection between them 
His-238 seems to interact with the cation with the main 
cham carbonyl oxygen as in TAA and PPA (5). Because 
almost any residue could provide this ligand for the Ca J+ 
binding, the side chain must also be important for the 
function of this residue. 

The negative charge of His-238-Asp causes electrostatic 
repulsion which could be compensated at least to a certain 
extent by change in the dihedral angles of the backbone and 
the side chain atoms. The lowered thermal stability of the 
mutant may be due to the reduced possibility of the main 
chain carbonyl oxygen contributing to Ca 5+ binding ^his 
leads to a more open flexible structure which could cause 
reduced thermal stability (31) by inducing a mdre unfolded 
structure (32, 33). 

Mutant A/s-232-Lys-Sequence comparisons have 
identified highly conserved Arg-232, for which no function 
has been suggested. It was therefore interesting to test the 
function of this amino acid by directed mutagenesis. The 
conservative mutation Arg to Lys was made, having the 
same length but missing the strongly basic tf-guanido 
group. 

The mutant Arg-232-Lys has about the same pH and 
temperature optimum as the parental protein but a slightly 
lower thermostability (Figs. 3 and 4). The end-product 
profile is essentially similar to the parental enzyme (Table 
I). The most important difference is lower specific activity 
only 12% of that found in the wild-type enzyme. This result 
is important because this amino acid was not previously 
thought to affect activity or substrate binding. The struc- 
tural model suggests that it is near but not bound to the 
substrate. However, it must contribute to function because 
the conservative mutation remarkably reduced the specific 
activity. 

We suggest that the Arg-232 is needed to withdraw 
Tyr-63 from the active site by ionic interaction. Sequence 
comparison shows that Tyr-63 and Arg-232 are conserved 
amino acids. The Tyr residue protrudes in a similar fashion 
into the active site region of TAA as in the structural 
models of the three Bacillus amylases. This may be due to 
the energy minimization used in the refinement of the TAA 
structure (4) f which has turned the side chain towards the 
active site cleft. The proposed substrate binding model of 
TAA does not suggest that Tyr-82 is involved in substrate 
binding in TAA. However, it does not rule out the possibil- 
ity that Tyr-63 forms a part of the "wall" of the active site 
cleft in BStA. We suggest that this residue interacts 
electrostatically with Arg-232 (Fig. 6A) so that the basic 
Arg binds to the hydroxyl oxygen of the phenyl group and 
orientates the side chain so that the active center is free to 
bind the substrate. The side chain of Arg is much more 
basic than that of Lys; thus the binding in the mutant is 
weaker and the Tyr-63 is free to rotate and so disturb the 
active center, reducing the specific activity (Fig. 6B). Also 
here the K m value is essentially similar to that for the 
parental enzyme. When Tyr-63 is bound to Lys-232 t the 
substrate binding and /Cn should be similar to the wild-type 
enzyme, but when unbound the substrate binding is hin- 
dered. 

It is possible that Arg-232 is also connected to the nearby 
Asp- 1 1 7 residue in the conserved region I. It is also possible 
that the highly basic Arg-232 could interact with both these 
residues and stabilize the structure of the enzyme by 
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decreasing the motility and flexibility of domain B. The 
mutant Arg-232-Lys cannot provide the basic guanido 
group of Arg for binding the other two side chains. Thus, the 
somewhat lowered thermal stability is probably due to less 
effective binding, which permits increased flexibility. 

This work supports the idea that the identification and 
study of conserved amino acids by sequence comparison, 
even in the absence of known tertiary structure, can lead to 
useful insights on the function and structure of proteins. 

We are grateful for Dr. Tuula Teeri and Dr. Leif Laaksonen for 
establishing a computer aided molecular modeling facility at VTT. 

REFERENCES 

1. Vihinen, M. & Mantsala, P. (1989) Cril Rev. Biochem. Mol. BioL 
24, 329-418 

2. Rogers, J.C. (1985) Biochem. Biophys. Res. Commun, 128, 470- 
476 

3. Mackay, R.M., Baird, S., Dove, M.J., Erratt, J.A., Gines, M., 
Moranelli, F., Nasim, A., Willick, G.E., Yaguchi, M. ( & Seligy, 
V.L. (1985) BioSystems 18, 279-292 

4. Matsuura, Y., Kusunoki, M., Harada, W., & Kakudo, M. (1984) 
J. Biochem. 95, 697-702 

5. Buisson, G., Duee, E., Haser, R., & Payan, F. (1987) EMBO J. 6, 
3909-3916 

6. Suominen, I., Karp, M., Lautamo, J. f Knowles, J., & Mantsala, 
P. (1987) in Extracellular Enzymes of Microorganisms (Chaloup- 
ka, J. & Kmmphanzl, J., eds.) pp. 129-137, Plenum Press, New 
York 

7. Suominen, I., Karp, M., Lahde, M., Kopio, A., Glumoff, T., 
Meyer, P., & Mantsala, P. (1987) Gene 61, 165-176 

8. Suominen, I., Kfipyla", J., Tilgmann, C. ( Glumoff, V., & MantsSla, 
P. (1988) FEMS Microbiol Lett 55, 3-8 

9. Reinikainen, P., Lahde, M., Karp, M., Suominen, I., Markkanen, 
P., & Mantsala, P. (1988) Biotech. Lett 10, 149-154 

10. Vieira, J. & Messing, J. (1982) Gene 19, 259-268 

11. Casadaban, M.J. & Cohen, S.N. (1980) J. Mol. BioL 138, 179- 
207 



12. Kunkel, T.A., Roberts, J.D., & Zakour, R.A. (1987) Methods 
EnzymoL 154, 367-382 

13. Yanisch-Perron, C., Vieira, J., & Messing, J. (1985) Gene 33 
103-119 

14. Maniatis, T., Fritsch, E.F., & Sambrook, J. (1982) Molecular 
Cloning. A Laboratory Manual, Cold Spring Harbor Laboratory 
Cold Spring Harbor, N. Y. 

15. Cookson, C. (1978) Anal. Biochem. 88, 340-343 

16. Zagursky, R.J., Berman, M.L., Baumeister, K., & Lomax N 
(1985) Gene Anal Tech. 2, 89-94 

17. Norgaard t M,V.,Keem,K.,&Monahan,J.J. (1978)Ge/ie3 279- 
292 

18. Gryczan, T.J., Contente, S., & Dubnau, D. (1978) J. Bacterial. 
134, 318-329 

19. Kunkel, T.A. (1985) Proc Natl. Acad. Sci. U.S. 82, 488-492 

20. Anagnostopoulos, C. & Spiziien, J. (1961) J. Bacterid 81 741- 
746 

21. Minsky, A., Summers, R.G., & Knowles, J.R. (1986) Proc Natl 
Acad, Scu U.S. 83, 4180-4184 

22. Smith, T.F. & Waterman, M.S. (1981) Adv. AppL Math. 2, 482- 
489 

23. Needleman, S.B. & Wunsch, CD. (1970) J. Mol BioL 48, 443- 
453 

24. Devereux, J., Haeberli, P., & Smithies, O. (1984) Nucleic Acids 
Res. 12, 387-395 

25. Vihinen, M. (1988) CompuL AppL BioscL 4, 89-92 

26. Yuuki, T., Nomura, T., Texuka, H. t Tsuboi, A., Yamagata, H., 
Tsukagoshi, N., & Udaka, S. (1985) J. Biochem. 98, 1147-1156 

27. Takkinen, K. ( Pettersson, R.F., Kalkkinen, N„ Palva, I., Sader- 
lund, H., & KaanSinen, L. (1983) J. BioL Chem. 258, 1007-1013 

28. Jones, T.A. (1978) J. AppL CrysL 11, 268-272 

29. Vihinen, M. & Mfintsfila, P. Biochem. Biophys. Res. Commun. in 
press ; 

1 30. Nakajima, R., Imanaka, T., & Aiba, S. (1986) AppL Microbiol 
Biotech. 23, 355-360 

31. Vihinen, M. (1987) Prot Eng. 1, 477-480 

32. Tomazic, S.J. & Klibanov, A. (1988) J. BioL Chem. 263, 3086- 
3091 

33. Tomazic, S.J. & Klibanov, A. (1988) J. BioL Chem. 263, 3092- 
3096 



J. Biochem. 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

Ig-BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

IB COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE^) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: : ■ 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



